Vectors and compounds for expression of zymogen forms of human protein C

ABSTRACT

A method for the recombinant production of zymogen forms of human protein C is described. These zymogen forms differ from native zymogen protein C in their increased sensitivity to activation by thrombin and thrombin/thrombomodulin. DNA compounds, vectors, and transformants useful in the method are also disclosed.

SUMMARY OF THE INVENTION

The present invention provides novel DNA compounds and recombinant DNA cloning vectors that encode novel zymogen forms of human protein C. These zymogens can be activated in vivo by thrombin alone at a rate of clinical significance and are much more susceptible to activation by thrombin/thrombomodulin than native protein C zymogen. The expression vectors provide a simple and efficient means for expressing these human protein C zymogens in recombinant host cells. Native human protein C zymogens require treatment with high levels of thrombin, or thrombin and thrombomodulin, or other expensive enzymes for activation. The present invention provides a method for producing zymogen forms of human protein C that serve as much better substrates for thrombin and consequently can be activated in the presence of lower levels of thrombin, or thrombin/thrombomodulin, or other enzymes. Most importantly, the zymogen forms of human protein C of the invention can be activated by thrombin even in the presence of physiological Ca²⁺, which is inhibitory to the activation of native protein C zymogen by thrombin. The novel zymogen forms of human protein C differ from those known in the art in the amino acid residue sequences in the region of the activation peptide, which is removed from the zymogen forms to produce activated human protein C. These novel zymogen forms of protein C offer special advantages in the treatment of blood disorders involving coagulation.

BACKGROUND OF THE INVENTION The Role of Protein C in the Regulation of Blood Coagulation

Protein C, a vitamin K dependent plasma protein, is of major physiological importance in the control of hemostasis. Protein C is synthesized as an inactive molecule, herein called nascent protein C. Nascent protein C undergoes complex processing, giving rise to a number of different inactive molecules as is more fully described below. Inactive, secreted forms of protein C are referred to herein as zymogen protein C. Activation of protein C occurs in the blood by a reaction involving a thrombomodulin-thrombin complex. Activated protein C, together with its cofactor protein S, is an anticoagulant of important physiological significance. Activated protein C can prevent intravascular thrombosis and control the extension of existing clots. The mechanism of action of the activated form of protein C and the mechanism of activation of the inactive zymogen into the active protease have been clarified in recent years (for review, see J. E. Gardiner and J. H. Griffin, Progress in Hematology, Vol. XIII, pp. 265-278, ed. Elmer B. Brown, Grune and Stratton, Inc., 1983, and Esmon, N. L., 1989, Prog. Hemost. Thromb. 9:29-55).

The activation of protein C involves thrombin, the final serine protease in the coagulation cascade, and an endothelial cell membrane-associated glycoprotein called thrombomodulin. Thrombomodulin forms a tight, stoichiometric complex with thrombin. Thrombomodulin, when complexed with thrombin, dramatically changes the functional properties of thrombin. Thrombin normally clots fibrinogen, activates platelets, and converts clotting cofactors V and VIII to their activated forms, Va and VIIIa. Finally, thrombin activates protein C, but only very slowly and inefficiently, and the activation is further inhibited by physiological Ca²⁺. In contrast, thrombin complexed with thrombomodulin does not clot fibrinogen, activate platelets, or convert clotting factors V and VIII to their activated counterparts Va and VIIIa, but does become a very efficient activator of protein C zymogen in the presence of physiological Ca²⁺. The rate constant of protein C zymogen activation by thrombomodulin-thrombin is over 1,000 fold higher than the rate constant for thrombin alone.

To understand how activated protein C down-regulates blood coagulation, the following brief description of the coagulation enzyme system is provided. The coagulation system is best looked at as a chain reaction involving the sequential activation of zymogens into active serine proteases. This chain reaction eventually produces the enzyme thrombin, which through limited proteolysis converts plasma fibrinogen into the insoluble gel fibrin. Two key events in the coagulation cascade are the conversion of clotting factor X to Xa by clotting factor IXa and the conversion of prothrombin into thrombin by clotting factor Xa. Both of these reactions occur on cell surfaces, most notably the platelet surface, and both reactions require cofactors. The major cofactors, factors V and VIII, in the system circulate as relatively inactive precursors, but when the first few molecules of thrombin are formed, thrombin loops back and activates the cofactors through limited proteolysis. The activated cofactors, Va and VIIIa, accelerate both the conversion of prothrombin into thrombin and also the conversion of factor X to factor Xa by approximately five orders of magnitude. Activated protein C preferentially acts on, to proteolytically degrade, hydrolyze, and irreversibly destroy clotting cofactors Va and VIIIa, the activated forms of the inactive clotting factors V and VIII. Clotting factors V and VIII, in contrast, are very poor substrates for activated protein C in vivo.

An important cofactor for activated protein C is protein S, another vitamin K-dependent plasma protein. Protein S substantially increases activated protein C-mediated hydrolysis of factors Va and VIIIa 25 fold.

Protein C as a Therapeutic Agent

Protein C is recognized as a valuable therapeutic agent (see, for example, Bang et al., U.S. Pat. No. 4,775,624, issued Oct. 4, 1988, the teaching of which is incorporated herein by reference). Activated protein C is a novel antithrombotic agent with a wider therapeutic index than available anticoagulants, such as heparin and the oral hydroxycoumarin type anticoagulants. Neither zymogen protein C nor activated protein C is effective until thrombin is generated, because thrombin is needed to convert clotting factors V to Va and VIII to VIIIa; the activated forms of these two cofactors are the preferred substrate for activated protein C. Thrombin is also required to activate zymogen protein C, for without the thrombomodulin-thrombin complex, the protein C zymogen is not efficiently converted into its active counterpart.

Activated protein C is an on-demand anticoagulant, because activated protein C works by inactivating cofactors Va and VIIIa. Because thrombin is required to convert factors V and VIII to their activated counterparts Va and VIIIa, protein C only acts as an anticoagulant after thrombin is generated. Conventional anticoagulants, in contrast to activated protein C, maintain a constant anticoagulant state throughout the circulation for as long as they are given to the patient, thereby substantially increasing the risk of bleeding complications over that for protein C or activated protein C. Activated protein C is therefore an on-demand anticoagulant of wide clinical utility for use as an alternative to heparin and the hydroxycoumarins.

In some disease states, such as hereditary protein C deficiency, protein C zymogen is of great therapeutic importance. In congenital homozygous protein C deficiency, affected individuals die in early childhood from purpura fulminans, an often lethal form of disseminated intravascular coagulation. In heterozygous protein C deficiency, affected individuals suffer severe, recurrent thromboembolic episodes. It is well established clinically that plasma protein concentrates designed to treat hemophilia B or factor IX deficiency, which contain protein C as an impurity, are effective in the prevention and treatment of intravascular clotting in heterozygous protein C deficiency. Protein C levels have also been noted to be abnormally low in thrombotic states such as disseminated intravascular coagulation and in disease states predisposing to thrombosis, such as major trauma, major surgery, and cancer.

The Synthesis and Activation of Human Protein C

To facilitate an understanding of the activation of protein C and the invention, the coding sequence, and corresponding amino acid residue sequence, for nascent human protein C is depicted below. This amino acid residue sequence, and relevant portions thereof, also characterizes "native human protein C" for purposes of the present invention. ##STR1## wherein A is deoxyadenyl, G is deoxyguanyl, C is deoxycytidyl, T is thymidyl, ALA is Alanine, ARG is Arginine, ASN is Asparagine, ASP is Aspartic acid, --COOH is the carboxy terminus, CYS is Cysteine, GLN is Glutamine, GLU is Glutamic Acid, GLY is Glycine, HIS is Histidine, H₂ N-is the amino terminus, ILE is Isoleucine, LEU is Leucine, LYS is Lysine, MET is Methionine, PHE is Phenylalanine, PRO is Proline, SER is Serine, THR is Threonine, TRP is Tryptophan, TYR is Tyrosine, and VAL is Valine.

The DNA sequence depicted above was derived from cDNA clones prepared from human liver mRNA that encodes human protein C. Those skilled in the art recognize that the degenerate nature of the genetic code enables one to construct many different DNA sequences that encode the same amino acid residue sequence. The cDNA sequence for nascent human protein C depicted above is thus only one of many possible nascent human protein C-encoding sequences. In constructing the cDNA clones, a 5' poly G sequence, a 3' poly C sequence, and both 5' and 3' PstI restriction enzyme recognition sequences were constructed at the ends of the protein C-encoding cDNA. Two of these cDNA clones were manipulated to construct a DNA molecule comprising both the coding sequence of nascent human protein C and also portions of the DNA encoding the untranslated mRNA at the 5' and 3' ends of the coding region. This DNA molecule was inserted into the PstI site of plasmid pBR322 to construct plasmid pHC7. Plasmid pHC7 thus comprises the coding sequence above and, again depicting only one strand of the molecule, also contains these additional sequences: ##STR2## at the 5' and 3' ends, respectively, of the coding strand of the nascent human protein C coding sequence. Due to the complementary nature of DNA base-pairing, the sequence of one strand of a double-stranded DNA molecule is sufficient to determine the sequence of the opposing strand. Plasmid pHC7 can be conventionally isolated from E. coli K12 RR1/pHC7, a strain deposited with and made part of the permanent stock culture collection of the Northern Regional Research Laboratory (NRRL), Peoria, Ill. A culture of E. coli K12 RRl/pHC7 can be obtained from the NRRL under the accession number NRRL B-15926.

Nascent protein C can also be depicted schematically, as shown below. ##STR3## pre-pro--amino acid residues 1-42 of nascent human protein C encode the signal peptide and propeptide of human protein C, important for directing secretion and γ-carboxylation of protein C.

LC--amino acid residues 43-197 of nascent protein C, once post-translationally modified, constitute the light chain (LC) of both the two-chain zymogen (formed from one-chain zymogen by removal of the KR dipeptide, as discussed below) and activated forms of protein C.

KR--amino acid residues 198-199 of nascent human protein C; these residues are believed to be removed (on the basis of homology with bovine protein C), probably by a two-step process comprising a first cleavage (either between residues 197-198 or 199-200) followed by carboxypeptidase or aminopeptidase action, to form two-chain protein C.

AP--amino acid residues 200-211 of nascent protein C constitute the activation peptide, which is removed from the zymogen forms of protein C to obtain activated protein C.

AHC--amino acid residues 212-461 of nascent protein C, once post-translationally modified, constitute the activated heavy chain (AHC) of active protein C.

HC--the heavy chain of the two chain form of protein C zymogen, once post-translationally modified, is composed of amino acid residues 200-461, the AP and AHC.

Human protein C zymogen is a serine protease precursor synthesized in the liver and present in the blood. For expression of complete biological activity, protein C requires post-translational modifications for which vitamin K is needed. The two-chain, disulfide-linked, protein C zymogen arises from the single-chain zymogen by limited proteolysis. This limited proteolysis is believed to include cleavage and removal of amino acid residues 198 and 199. The activation of the two-chain zymogen into the active serine protease involves the proteolytic cleavage of an ARG-LEU peptide bond (residues 211 and 212). This latter cleavage releases a dodecapeptide (residues 200-211), the activation peptide, that constitutes the amino-terminus of the larger (heavy) chain of the two-chain zymogen molecule. Protein C is significantly glycosylated; the mature enzyme from plasma contains 15-23% carbohydrate. Protein C also contains a number of unusual amino acids, including γ-carboxyglutamic acid and β-hydroxyaspartic acid (erythro-L-β-hydroxy aspartate). γ-carboxyglutamic acid (gla) is produced by γ-glutamyl carboxylation from glutamic acid residues with the aid of a hepatic microsomal carboxylase which requires vitamin K as a cofactor.

The activation of human protein C can also be represented schematically and is shown below. Those skilled in the art recognize that the order of the steps shown in the schematic do not necessarily reflect the order of the steps in the in vivo pathway. ##STR4## The present invention provides novel compounds, vectors, transformants, and methods for the recombinant expression of novel protein C zymogens.

Definitions

For purposes of the present invention, as disclosed and claimed herein, the following terms are as defined below.

Ad2LP--the major late promoter of adenovirus type 2.

Amino acid residues in proteins or peptides described herein as abbreviated as follows:

    ______________________________________                                         Three-Letter               One-Letter                                          Abbreviation                                                                               Amino Acid Residue                                                                            Abbreviation                                        ______________________________________                                         PHE         Phenylalanine  F                                                   LEU         Leucine        L                                                   ILE         Isoleucine     I                                                   MET         Methionine     M                                                   VAL         Valine         V                                                   SER         Serine         S                                                   PRO         Proline        P                                                   THR         Threonine      T                                                   ALA         Alanine        A                                                   TYR         Tyrosine       Y                                                   HIS         Histidine      H                                                   GLN         Glutamine      Q                                                   ASN         Asparagine     N                                                   LYS         Lysine         K                                                   ASP         Aspartic Acid  D                                                   GLU         Glutamic Acid  E                                                   CYS         Cysteine       C                                                   TRP         Tryptophan     W                                                   ARG         Arginine       R                                                   GLY         Glycine        G                                                   ______________________________________                                    

ApR--the ampicillin-resistant phenotype or gene conferring same.

BK--DNA from BK virus.

Enh or enhancer--the enhancer of BK virus.

ep or SV40ep--a DNA segment comprising the SV40 early promoter of the T-antigen gene, the T-antigen binding sites, the SV40 enhancer, and the SV40 origin of replication.

γ-carboxylation--a reaction which adds a carboxyl group to glutamic acids at the γ-carbon

γ-carboxylated protein--a protein in which some glutamic acids residues have undergone γ-carboxylation.

GBMT transcription unit--a modified transcription control unit comprising the P2 enhancer of BK virus spaced closely to the upstream regulatory element of the major late promoter of adenovirus, the adenovirus-2 major late promoter, a poly-GT element positioned to stimulate said promoter and a DNA sequence containing the spliced tripartite leader sequence of adenovirus. The GBMT transcription unit is found on an approximately 900 base pair HindIII restriction fragment of plasmid pGT-h.

IVS--DNA encoding an intron, also called an intervening sequence.

MMTpro--the promoter of the mouse metallothionein-I gene.

Nascent protein--the polypeptide produced upon translation of a mRNA transcript, prior to any post-translational modifications. However, post-translational modifications such as γ-carboxylation of glutamic acid residues and hydroxylation of aspartic acid residues may begin to occur before a protein is fully translated from an mRNA transcript.

NeoR--a neomycin resistance-conferring gene, which can also be used to confer resistance to the antibiotic G418.

pA--a DNA sequence encoding a polyadenylation signal

Promoter--a DNA sequence that directs transcription of DNA into RNA.

Protein C activity--any property of human protein C responsible for proteolytic, amidolytic, esterolytic, and biological (anticoagulant or profibrinolytic) activities. Methods for testing for protein anticoagulant activity are well known in the art, i.e., see Grinnell et al., 1987, Biotechnology 5:1189.

Recombinant DNA Cloning Vector--any agent, including, but not limited to; chromosomally integrating agents, autonomously replicating plasmids, and phages, comprising a DNA molecule to which one or more additional DNA segments can be or have been added.

Recombinant DNA Expression Vector--any recombinant DNA cloning vector into which a promoter has been incorporated and positioned to drive expression of a gene product.

Recombinant DNA Vector--any recombinant DNA cloning or expression vector.

Replicon--A DNA sequence that controls and allows for autonomous replication of a plasmid or other vector.

Restriction Fragment--any linear DNA sequence generated by the action of one or more restriction endonuclease enzymes.

Sensitive Host Cell--a host cell that cannot grow in the presence of a given antibiotic or other toxic compound without a DNA segment that confers resistance thereto.

TcR--the tetracycline-resistant phenotype or gene conferring same.

Transformation--the introduction of DNA into a recipient host cell that changes the genotype of the recipient cell.

Transformant--a recipient host cell that has undergone transformation.

Translational Activating Sequence--any DNA sequence, inclusive of that encoding a ribosome binding site and translational start codon, such as 5'-ATG-3', that provides for the translation of a mRNA transcript into a peptide or polypeptide.

Zymogen--an enzymatically inactive precursor of a proteolytic enzyme. Protein C zymogen, as used herein, secreted, inactive forms, whether one chain or two of protein C.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 is a restriction site and function map of plasmid pLPC-N. For purposes of the present disclosure, the Figures are not drawn exactly to scale.

FIG. 2 is a restriction site and function map of plasmid pLPC-FN.

FIG. 3 is a restriction site and function map of plasmid pLPC-SC.

FIG. 4 is a restriction site and function map of plasmid pLPC-LIN.

FIG. 5 is a restriction site and function map of plasmid pLPC-FLIN.

FIG. 6 is a restriction site and function map of plasmid pGTC.

FIG. 7 is a restriction site and function map of plasmid pGT-d.

FIG. 8 is a restriction site and function map of plasmid pGT-h.

DETAILED DESCRIPTION OF THE INVENTION

The present invention provides DNA compounds that code for the expression of novel zymogen forms of human protein C. Several methods of producing native human protein C zymogen and nascent human protein C have been described (see Bang et al., U.S. Pat. No. 4,775,624, issued Oct. 4, 1988, the entire teaching of which is herein incorporated by reference). These prior art methods provide for the expression of zymogen forms of human protein C that do not differ in amino acid sequence from the zymogen forms present in human blood. The protein C zymogen produced by these methods must be treated with substances such as α-thrombin, trypsin, or a mixture of thrombin and thrombomodulin (whether in vivo or in vitro) to obtain activated protein C. In addition, a zymogen form of human protein C produced by recombinant DNA technology that is identical in amino acid sequence to zymogen forms of human protein C found naturally in human blood will only be activated in the body by the natural activation pathway primarily involving the thrombin-thrombomodulin complex. Native human protein C zymogen can be activated by thrombin alone; however, the activation requires the absence of Ca²⁺ and such high levels of thrombin and/or protein C zymogen that it is not a significant in vivo pathway to activated protein C.

The present invention provides zymogen forms of human protein C that can be activated in vivo by thrombin alone at a rate of clinical significance. In addition, these zymogen forms are much more susceptible to activation by thrombin/thrombomodulin than native human protein C zymogen. The present invention also provides DNA compounds, recombinant DNA expression vectors, transformed cell lines, and methods for the recombinant expression of these novel zymogen forms of human protein C. The method for producing these zymogen forms of human protein C comprises:

(A) transforming a eukaryotic host cell with a recombinant DNA vector, said vector comprising:

(i) a DNA sequence that encodes an amino acid residue sequence, said amino residue sequence comprising, from the amino terminus to the carboxy terminus:

a) a signal peptide and pro-peptide of a γ-carboxylated, secreted protein;

b) the light chain of human protein C;

c) a dipeptide selected from the group consisting of LYS-ARG, ARG-LYS, LYS-LYS, and ARG-ARG; and

d) the amino acid residue sequence: ##STR5## wherein R₁ is selected from the group consisting of ASP, and LEU, R₂ is selected from the group consisting of GLN and HIS, R₃ is selected from the group consisting of GLU and LYS, R₄ is selected from the group consisting of ASP and LEU, R₅ is GLN, R₆ is selected from the group consisting of VAL and THR, R₇ is selected from the group consisting of ASP, PHE and TYR, R₈ is PRO, R₉ is ARG, R₁₀ is selected from the group consisting of LEU and THR, R₁₁ is selected from the group consisting of ILE or a deletion, R₁₂ is ASN and --COOH is the carboxy terminus; and

(ii) a promoter positioned to drive expression of said DNA sequence; and

(B) culturing said host cell transformed in step (A) under conditions that allow for expression of said DNA sequence. This method and compounds useful in the method are more fully described below.

The invention also provides DNA compounds for use in the method of producing these novel zymogen forms of human protein C. These novel compounds all encode a pre-propeptide comprising a signal peptide for directing secretion and a propeptide from a γ-carboxylated (through the action of a vitamin K-dependent carboxylase) protein. Such propeptide sequences are well-known in the art. See, for example, Suttie et al., 1987, Proc. Natl. Acad. Sci. 84:634-637. Preferably, and for ease of construction, both the signal peptide coding sequence and the propeptide coding sequence will be derived from the amino acid residue sequence of the pre-propeptide of a γ-carboxylated protein. Examples of such γ-carboxylated proteins include, but are not limited to, factor VII, factor IX, factor X, prothrombin, protein S, protein Z, and, protein C. A DNA sequence encoding the pre-propeptide of human protein C is most preferred for use in the vectors of the invention.

The DNA compounds of the invention further comprise the coding sequence for the light chain of human protein C positioned immediately adjacent to, downstream of, and in translational reading frame with the pre-propeptide coding sequence. The light chain of human protein C contains amino acid residues 43 to 197, inclusive, of nascent protein C, as depicted in the background section above The amino-terminal portions of the vitamin K-dependent plasma proteins, such as the amino-terminal portion of the light chain of protein C, have calcium-binding sites. The calcium-binding domains of these plasma proteins, such as factor VII, factor IX, factor X, prothrombin, and protein S, may be used in a manner (see European Patent Publication No. 0215548A1, at pages 12 and 13) equivalent to the calcium-binding domain of the light chain of human protein C.

The DNA compounds of the invention further comprise the coding sequence for the dipeptide LYS-ARG (KR) positioned immediately adjacent to, downstream of, and in translational reading frame with the light chain coding sequence. A dibasic dipeptide such as LYS-ARG is positioned in the nascent protein at the carboxyl-terminal side of the light chain. The orientation of the LYS-ARG dipeptide in the expressed protein is irrelevant for purposes of the present invention. Dibasic dipeptides such as LYS-LYS or ARG-ARG are equivalent to the LYS-ARG dipeptide for purposes of the present invention. For purposes of the present invention, however, the dipeptide LYS-ARG, which is the dipeptide in native human protein C, is preferred.

Immediately downstream of the codons for the LYS-ARG dipeptide is the coding sequence of the activation peptide. In the compounds of the invention, changes in the activation peptide coding sequence and first 3 aa of the heavy chain (and corresponding amino acid sequence) are primarily responsible for the property of increased thrombin-sensitivity of these novel zymogens.

Those skilled in the art will recognize that the zymogen forms of the present invention primarily differ from native zymogen forms of human protein C as described below. In native human protein C the activation peptide and first 3 aa of the heavy chain is: ##STR6## in which the numbers refer to the position of the amino acid residues in nascent human protein C. The present invention discloses that changing various residues will result in the corresponding zymogen form having a greater sensitivity to cleavage by thrombin alone, in addition to a greater sensitivity to cleavage by the thrombin-thrombomodulin complex.

The various amino acid deletions and substitutions of the present invention lead to the formation of mutant forms which have augmented thrombin-sensitivity values for the resulting zymogen. The phrase "resulting zymogen" is used to indicate that although substitutions are described with reference to amino acid positions in nascent human protein C, nascent human protein C must first be secreted (resulting in removal of amino acid residues 1 through 42) to obtain a zymogen form. Substitution of the aspartic acid residue (in the activation peptide) at position 214 in nascent human protein C for an asparagine residue results in a novel zymogen of the present invention. The deletion (rather than substitution) of an amino acid residue also results in novel zymogen forms of protein C. For ease of understanding and numbering, a deletion is represented by a zero (0). When an amino acid is deleted, the amino acids on either side of the deleted residue are linked to form the contiguous zymogen chain. Table I displays the various novel zymogen forms of protein C of this invention.

                                      TABLE I                                      __________________________________________________________________________              ZymogenForm*                                                                   R.sub.1                                                                           R.sub.2                                                                            R.sub.3                                                                            R.sub.4                                                                           R.sub.5                                                                            R.sub.6                                                                           R.sub.7                                                                           R.sub.8                                                                           R.sub.9                                                                            R.sub.10                                                                          R.sub.11                                                                          R.sub.12                                  203                                                                               204 205 206                                                                               207 208                                                                               209                                                                               210                                                                               211 212                                                                               213                                                                               214                              __________________________________________________________________________     Native Protein C                                                                        ASP                                                                               GLN GLU ASP                                                                               GLN VAL                                                                               ASP                                                                               PRO                                                                               ARG LEU                                                                               ILE                                                                               ASP                              N        ASP                                                                               GLN GLU ASP                                                                               GLN VAL                                                                               ASP                                                                               PRO                                                                               ARG LEU                                                                                -φ                                                                            ##STR7##                        FN       ASP                                                                               GLN GLU ASP                                                                               GLN VAL                                                                                ##STR8##                                                                         PRO                                                                               ARG LEU                                                                                -φ                                                                            ##STR9##                        SC                                                                                       ##STR10##                                                                         ##STR11##                                                                          ##STR12##                                                                          ##STR13##                                                                        GLN                                                                                 ##STR14##                                                                         ##STR15##                                                                        PRO                                                                               ARG                                                                                 ##STR16##                                                                         -φ                                                                            ##STR17##                       LIN      ASP                                                                               GLN GLU ASP                                                                               GLN VAL                                                                               ASP                                                                               PRO                                                                               ARG LEU                                                                               ILE                                                                                ##STR18##                       FLIN     ASP                                                                               GLN GLU ASP                                                                               GLN VAL                                                                                ##STR19##                                                                        PRO                                                                               ARG LEU                                                                               ILE                                                                                ##STR20##                       __________________________________________________________________________      *Substitutions and deletions are underlined                              

Thus, the preferred novel zymogen forms of human protein C of the present invention result from secretion and processing of nascent human protein C molecules with the amino acid residue sequence depicted below: ##STR21## wherein R₁ is selected from the group consisting of ASP, and LEU, R₂ is selected from the group consisting of GLN and HIS, R₃ is selected from the group consisting of GLU and LYS, R₄ is selected from the group consisting of ASP and LEU, R₅ is GLN, R₆ is selected from the group consisting of VAL and THR, R₇ is selected from the group consisting fo ASP, PHE and TYR, R₈ is PRO, R₉ is ARG, R₁₀ is selected from the group consisting of LEU and THR, R₁₁ is selected from the group consisting of ILE or a deletion, R₁₂ is ASN and --COOH is the carboxy terminus.

Those skilled in the art will recognize that, due to the degeneracy of the genetic code, a variety of DNA compounds can encode the polypeptide depicted above. Consequently, the constructions described below and in the accompanying Examples for the preferred DNA compounds, vectors, and transformants of the invention are merely illustrative and do not limit the scope of the invention.

All of the DNA compounds of the present invention were prepared by site-directed mutagenesis of the human protein C gene. The mutagenized zymogen-encoding molecules were then inserted into eukaryotic expression vectors such that expression of the zymogen genes was driven by the major late promoter of adenovirus-2. The vectors also comprise the P2 enhancer element of the BK virus positioned to enhance expression from the promoter. The vectors were transformed into Escherichia coli K12 AG1 cells and deposited and made part of the permanent stock culture collection of the Northern Regional Research Laboratories in Peoria, Ill. 61604. The specific cultures, deposit dates and accession numbers are found in Table II.

                  TABLE II                                                         ______________________________________                                                                        Date                                            Culture          Accession Number                                                                             of Deposit                                      ______________________________________                                         E. coli K12 AG1/pLPC-N                                                                          NRRL B-18612  01/09/90                                        E. coli K12 AG1/pLPC-FN                                                                         NRRL B-18613  01/09/90                                        E. coli K12 AG1/pLPC-SC                                                                         NRRL B-18614  01/13/90                                        E. coli K12 AG1/pLPC-LIN                                                                        NRRL B-18615  01/13/90                                        E. coli K12 AG1/pLPC-FLIN                                                                       NRRL B-18616  01/13/90                                        ______________________________________                                    

The cultures are obtained and the plasmids are isolated using conventional techniques, and then may be directly transfected into eukaryotic host cells for the production of the zymogen forms of human protein C. It is preferable to transform the plasmids into host cells which expresses the adenovirus E1A immediate-early gene product, in that the BK enhancer found on the vectors functions to enhance expression most efficiently in the presence of E1A. Skilled artisan will realize that a number of host cells express, or can be made to express, an immediate early gene product of a large DNA virus. Preferred cell lines are the human kidney 293 cell line (available from the American Type Culture Collection under accession number ATCC CRL 1573) or the Syrian Hamster cell line AV12 (ATCC 9595). Embryonic human kidney cell line 293 is most preferred.

To obtain even higher levels of expression, the genes encoding the various zymogen forms of protein C can be cut out of the deposited vectors and ligated into a vector which contains the GBMT transcription control unit. Specifically, plasmid pGTC, which contains the native human protein C gene driven by the GBMT unit, can be obtained (in E. coli K12 AG1) from the NRRL under the accession number NRRL B-18593. The native gene is removed via digestion of the plasmid grown in a dam⁻ strain of E. coli with restriction enzyme BclI. The novel zymogen genes can each be removed from their respective plasmids via BclI digestion. The vector backbone is purified and dephosphorylated, then any of the novel zymogen genes of the present invention are ligated into the BclI restriction site. The plasmids comprising the novel zymogen genes positioned for expression behind the GBMT transcription unit are then transformed into 293 cells, cultured and the novel zymogens can be purified from the culture by techniques which are well known in the art. One method for the purification of human protein C from cell culture is disclosed in Yan, U.S. patent application No. 4,981,952, issued Jan. 1, 1991, the entire teaching of which is herein incorporated by reference. The GBMT transcription unit is described in more detail in Grinnell et al., U.S. patent application Ser. No. 07/484,082, filed herewith on even date, the entire teaching of which is herein incorporated by reference.

The compounds of the invention also include the zymogen forms generated upon secretion of the nascent proteins of the invention. The activated protein C derivatives produced upon activation of the zymogen forms of protein C are also compounds of the invention. Thus, the compounds of the invention include DNA coding sequences, expression vectors that drive expression of those sequences, nascent proteins produced upon translation of mRNA transcripts generated from those coding sequences, zymogens produced upon secretion of those nascent proteins, and activated derivatives of certain of the zymogens.

The DNA compounds of the invention can also be synthesized chemically, or by combining restriction fragments, or by a combination of techniques known in the art. DNA synthesizing machines are also available and can be used to construct the compounds of the invention.

The illustrative vectors of the invention comprise the BK enhancer positioned to stimulate transcription by the adenovirus major late promoter of the coding sequence of the invention. Those skilled in the art recognize that a great number of eukaryotic promoters, enhancers, and expression vectors are known in the art and can be used in the method of the present invention. Those skilled in the art also recognize that a eukaryotic expression vector can function without an enhancer element. The key aspect of the present invention does not reside in the particular enhancer, if any, or promoter, used to drive expression of the protein C zymogen but rather resides in the novel coding sequence and corresponding proteins produced from that sequence.

However, choice of vector elements, such as promoters, enhancers, and selectable markers, can have great impact on the ultimate levels of protein produced by a eukaryotic host cell. U.S. patent application Ser. No. 849,999, filed Apr. 9, 1986, incorporated herein by reference, discloses a number of expression vectors for native zymogen protein C that utilize the BK enhancer to stimulate a eukaryotic promoter positioned to drive expression of nascent human protein C. These vectors drive especially high expression levels when transformed into eukaryotic cells that also express an immediate-early gene product of a large DNA virus, such as the E1A gene product of adenovirus. As is evident from the illustrative vectors pGT-N, pGT-FN, pGT-SC, pGT-LIN and pGT-FLIN disclosed herein, the GBMT-E1A gene product expression method of U.S. patent application Ser. No. 07/484,082 is especially preferred for use with the vectors of the present invention.

The present invention is not limited to use in a particular eukaryotic host cell. A variety of eukaryotic host cells are available from depositories such as the American Type Culture Collection (ATCC) Rockville, Md. 20852, and are suitable for use with the vectors of the invention. The choice of a particular host cell depends to some extent on the particular expression vector used to drive expression of the protein C-encoding DNA compounds of the invention. Because nascent human protein C and the nascent human protein C derivatives of the invention undergo substantial post-translational modification, however, some host cells are more preferred for use with the vectors of the invention. U.S. patent application Ser. No. 849,999 and Grinnell et al., 1987, Bio/Technology 5:1189 disclose that adenovirus-transformed, human embryonic kidney cells are especially preferred for use in the recombinant production of γ-carboxylated proteins such as human protein C. One such adenovirus-transformed, human embryonic kidney cell line is the 293 cell line, available from the ATCC under the accession number ATCC CRL 1573. The 293 cell line is also preferred for use with the vectors of the present invention.

However, the advantages of producing a γ-carboxylated protein, such as human protein C zymogen, in an adenovirus-transformed cell line are not limited to adenovirus-transformed human embryonic kidney cells. In fact, adenovirus-transformed cells in general are exceptional hosts for the production of γ-carboxylated human protein C. One especially preferred cell line of this type is the AV12-664 (hereinafter "AV12") cell line, available from the ATCC under the accession number ATCC CRL 9595. The AV12 cell line was constructed by injecting a Syrian hamster in the scruff of the neck with human adenovirus 12 and isolating cells from the resulting tumor. Example 3, below, describes the transformation of both the 293 and AV12 cell lines with illustrative vector pGT-N.

The vectors of the invention can be transformed into and expressed in a variety of eukaryotic, especially mammalian, host cells. Vectors of the invention that possess no selectable marker with which to isolate and identify stable eukaryotic transformants are useful not only for purposes of transient assay but also for purposes of cotransformation, a procedure disclosed in U.S. Pat. No. 4,399,216, issued Aug. 26, 1983, and incorporated herein by reference. The vectors of the invention can also comprise sequences that allow for replication in E. coli, as it is usually more efficient to prepare plasmid DNA in E. coli than in other host organisms.

Expression of the coding sequences for human protein C contained on the vectors of the invention occurs in those host cells in which the particular promoter associated with the structural gene functions. Exemplary host cells suitable for use in the invention are listed in Table III, along with appropriate comments.

                  TABLE II                                                         ______________________________________                                         Host Cell                                                                              Origin      Source    Comments                                         ______________________________________                                         HepG-2  Human Liver *ATCC     U.S. Pat. No.                                            Hepatoblastoma                                                                             #HB 8065  4,393,133 describes the                                                        use of this cell line.                           CV-1    African Green                                                                              ATCC                                                               Monkey Kidney                                                                              #CCL 70                                                    LLC-MK.sub.2                                                                           Rhesus Monkey                                                                              ATCC                                                       original                                                                               Kidney      #CCL 7                                                     LLC-MK.sub.2                                                                           Rhesus Monkey                                                                              ATCC      Grows faster than                                derivative                                                                             Kidney      #CCL 7.1  ATCC #CCL 7                                      3T3     Mouse Embryo                                                                               ATCC                                                               Fibroblasts #CCL 92                                                    CHO-K1  Chinese     ATCC      Proline-requiring.                                       Hamster Ovary                                                                              #CCL 61   Derivatives of                                                                 CHO-K1, such as the                                                            dhfr- derivative                                                               DXB11, can be gener-                                                           ated from this host.                             HeLa    Human Cervix                                                                               ATCC                                                               Epitheloid  #CCL 2                                                     RPMI8226                                                                               Human       ATCC      IgG lambda-type                                          Myeloma     #CCL 155  light chain secreting                            H4IIEC3 Rat Hepatoma                                                                               ATCC      Derivatives, such as                                                 #CRL 1600 8-azaguanine-resistant                                                         FAZA host cells, can                                                           be generated from this                                                         host.                                            C127I   Mouse       ATCC                                                               Fibroblast  #CRL 1616                                                  HS-Sultan                                                                              Human Plasma                                                                               ATCC                                                               Cell Plasmo-                                                                               #CRL 1484                                                          cytoma                                                                 BHK-21  Baby Hamster                                                                               ATCC                                                               Kidney      #CCL 10                                                    ______________________________________                                          *American Type Culture Collection, 12301 Parklawn Drive, Rockville,            Maryland 208521776                                                       

As indicated by Table III, many mammalian host cells possess the necessary cellular machinery for the recognition and proper processing of the signal peptide on the nascent proteins of the invention and provide the post-translational modifications, such as glycosylation, γ-carboxylation, and β-hydroxylation, as are observed in human protein C present in blood plasma. However, as indicated above, optimal posttranslational processing of HPC occurs in adenovirus-transformed cells. A wide variety of vectors, discussed below, exists for the transformation of such eukaryotic host cells, but the specific vectors exemplified below are in no way intended to limit the scope of the present invention.

The pSV2-type vectors comprise segments of the SV40 genome that constitute a defined eukaryotic transcription unit--promoter (ep), intervening sequence (IVS), and polyadenylation (pA) site. In the absence of SV40 T-antigen, the plasmid pSV2-type vectors transform mammalian and other eukaryotic host cells by integrating into the host cell chromosomal DNA. A variety of plasmid pSV2-type vectors have been constructed (see Eukaryotic Viral Vectors, edited by Gluzman, published by Cold Spring Harbor Laboratories, Cold Spring Harbor, N.Y., 1982), such as plasmids pSV2-gpt, pSV2-neo, pSV2-dhfr, pSV2-hyg, and pSV2-β-globin, in which the SV40 promoter drives transcription of an inserted gene. These vectors are suitable for use with the coding sequences of the invention and are available either from the American Type Culture Collection (ATCC) in Rockville, Md. or from the Northern Regional Research Laboratory (NRRL) in Peoria, Ill.

Plasmid pSV2-dhfr (ATCC 37146) comprises a murine dihydrofolate reductase (dhfr) gene under the control of the SV40 early promoter Under the appropriate conditions, the dhfr gene is known to be amplified, or copied, in the host chromosome. This amplification, described in a review article by Schimke, 1984, Cell 37:705-713, can involve DNA sequences closely contiguous with the dhfr gene, such as a nascent human protein C-encoding sequence of the invention, and thus can be used to increase production of the protein C zymogens of the invention.

Plasmids which were constructed for expression of the nascent protein C and protein C zymogens of the invention in mammalian and other eukaryotic host cells can utilize a wide variety of promoters. The present invention is in no way limited to the use of the particular eukaryotic promoters exemplified herein. Promoters such as the SV40 late promoter or the eukaryotic promoters disclosed in Bucher et al., 1986, Nuc. Acids Res. 14(24):1009, or promoters from eukaryotic genes, such as, for example, the estrogen-inducible chicken ovalbumin gene, the interferon genes, the glucocorticoid-inducible tyrosine aminotransferase gene, the thymidine kinase gene, and the major early and late adenovirus genes, can be readily isolated and modified for use on recombinant DNA expression vectors designed to produce human protein C zymogen in eukaryotic host cells. Eukaryotic promoters can also be used in tandem to drive expression of a coding sequence of the invention. Furthermore, a large number of retroviruses are known that infect a wide range of eukaryotic host cells. The long terminal repeats in the retrovirus DNA often encode promoter activity and thus can be used to drive expression of the coding sequences of the invention.

Plasmid pRSVcat (ATCC 37152) comprises portions of the long terminal repeat of the Rous Sarcoma virus (RSV), a virus known to infect chicken and other host cells. The RSV long terminal repeat sequences can be isolated on an ˜0.76 kb NdeI-HindIII restriction fragment of plasmid pRSVcat. The promoter in the RSV long terminal repeat (Gorman et al., 1982, P.N.A.S. 79:6777) is suitable for use in vectors of the invention. Plasmid pMSVi (NRRL B-15929) comprises the long terminal repeats of the Murine Sarcoma virus (MSV), a virus known to infect mouse and other host cells. These repeat sequences are suitable for use as a promoter in the vectors of the invention. The mouse metallothionein (MMT) promoter has also been well characterized for use in eukaryotic host cells and is suitable for use in the vectors of the invention. The MMT promoter is present in the 15 kb plasmid pdBPV-MMTneo (ATCC 37224), which can serve as the starting material for the construction of other plasmids of the present invention.

Many modifications and variations of the present illustrative DNA sequences and plasmids are possible. For example, the degeneracy of the genetic code allows for the substitution of nucleotides throughout polypeptide coding regions, as well as in the translational stop signal, without alteration of the encoded polypeptide coding sequence. Such substitutable sequences can be deduced from the known amino acid or DNA sequence of human protein C and can be constructed by following conventional synthetic or site-specific mutagenesis procedures. Synthetic methods can be carried out in substantial accordance with the procedures of Itakura et al., 1977 Science 198:1056 and Crea et al., 1978, Proc. Nat. Acad. Sci. USA 75:5765. Therefore, the present invention is in no way limited to the DNA sequences and plasmids specifically exemplified.

After transformation of a vector of the invention into a eukaryotic host cell, one can select transformants on the basis of a selectable phenotype. This selectable phenotype can be conferred either by a selectable marker present on the expression vector or present on another vector cotransformed with the expression vector into the host cell. Once transformants are selected, it is desirable to identify which transformants are expressing the highest levels of the desired protein encoded on the expression vector. Such identification is especially important after a cotransformation procedure, which generates a number of transformants that contain only the plasmid containing the selectable marker and so do not contain the expression vector. In Examples 3 and 4, below, a protocol not only for identifying cells that express and secrete a desired protein but also for quantifying, relative to the other cells examined using the method, the amount of protein secreted is described. The protocol also allows for the isolation of viable cells secreting the highest levels of a desired protein.

Methods for the activation of zymogen forms of human protein C are old and well known to the skilled artisan. Protein C may be activated by thrombin alone, by a thrombin/thrombomodulin complex, by Russell's viper venom, or by a wide variety of other means. To compare the various activation rates of the zymogens of the present invention, thrombin alone was used, either in the presence of 3 mM CaCl₂, or in the presence of 8 mM EDTA. Thrombin activation and protein C activity assays (amidolytic and anticoagulant) were performed as per Grinnell et al., 1987, Biotechnology 5:1189-1192, the teaching of which is herein incorporated by reference. Other methods for activating protein C using immobilized thrombin are disclosed in Yan, U.S. patent application Ser. No. 07/403,516, filed Sep. 5, 1989, the teaching of which is herein incorporated by reference. The relative rates of activation are disclosed in Table IV.

                  TABLE IV                                                         ______________________________________                                                       Thrombin    Thrombin                                             Zymogen Form  (3 mM CaCl.sub.2)                                                                          (8 mM EDTA)                                          ______________________________________                                         Wild Type      1          1                                                    F167          20          2                                                    LIN            3            1.3                                                FLIN          30          3                                                    FN            ˜450  5-6                                                  SC            ˜260  5-6                                                  ______________________________________                                    

Eymogen F167 contains the wild type activation peptide except the amino acid at position 209 was changed from Aspartic Acid to Phenylalanine. Zymogen F167 is disclosed in U.S. patent application Ser. No. 07/138,009, filed Dec. 28, 1987, the teaching of which is herein incorporated by reference. The functional anticoagulant activities of the F167, LIN and FLIN zymogen mutants were 100 to 109% of the wild type control. The activities of the FN and SC mutants were less than 10% of control due to very low amidolytic activities.

Activated protein C has substantial antithrombotic properties in the prevention of extension of intravenous thrombi, in the prevention of formation of arterial thrombi, and in the prevention of death and organ failure from Gram negative sepsis, endotoxemia, and disseminated intravascular coagulation. In animal experiments, infusion of native zymogen protein C was without effect in the treatment of Gram negative septicemia with shock and disseminated intravascular coagulation (DIC). These negative results indicated that in this form of widespread microvascular thrombosis involving massive thrombin generation, insufficient thrombomodulin was present to complex with thrombin and activate the infused zymogen.

The major disadvantage of activated protein C, as with other activated serine protease, is its short half-life (T1/2) as compared to the zymogen precursor. The T1/2 in dogs was established to be 11 minutes and the T1/2 in monkeys to be 22 to 26 minutes. In comparison, the T1/2 of native protein C zymogen in man is estimated at 6 hours. The reason for the shorter biological half lives of activated serine proteases, including activated protein C, as compared to their zymogens, are complex and involve both cellular and humoral mechanisms. Activated serine proteases also form complexes with serine protease inhibitors normally present in plasma. Activated protein C (APC) complexes with a newly described APC inhibitor as well as with alpha-2 macroglobulin. The inactive zymogens, including the protein C zymogens of the invention, do not react with serine protease inhibitors.

The advantage of the protein C zymogens of this invention is that they are better activated by thrombin than native protein C zymogen, because thrombin has a dramatically reduced requirement for complexing with thrombomodulin to activate these zymogens in the presence of Ca²⁺. It follows that these protein C zymogens, when administered, can be activated at sites of intravascular thrombin generation, i.e., at any site where an intravascular thrombus is under developement. Thus, these recombinant protein C zymogens can be used as pro drugs and will become activated at the sites of thrombin generation. Because these thrombin-sensitive zymogens can be administered in the zymogen form, they will not complex with protein C inhibitors and may exhibit a biological half-life equal to that of native protein C zymogen.

The recombinant protein C zymogens of the invention are useful in the prevention and treatment of a wide variety of acquired disease states involving intravascular coagulation, including deep vein thrombosis, pulmonary embolism, peripheral arterial thrombosis, emboli originating from the heart or peripheral arteries, acute myocardial infarction, thrombotic strokes, and dis-seminated intravascular coagulation. These protein C derivatives can also be used efficiently in the treatment of the significant numbers of patients with heterozygous protein C deficiencies presenting recurrent deep vein thrombosis and in the case of the homozygous protein C deficient patients with purpura fulminans.

Experimental and clinical data suggest that conventional anticoagulants, particularly warfarin, are useful in the treatment of invasive cancers and act to prevent or reduce the distant metastatic lesions of these malignancies. In addition, it is well established that inflammatory stimuli, such as endotoxins, tumor necrosis factor, and interleukin 1, deplete thrombomodulin from the surface of endothelial cells, which is thought to trigger microvascular and macrovascular thrombosis. The recombinant protein C zymogens of the invention represent an attractive alternative to conventional anticoagulants in these clinical situations.

An attractive therapeutic indication for activated protein C is in the prevention of deep vein thrombosis and pulmonary embolism, currently treated with low doses of heparin. The added advantage of these zymogens is that they may be given as bolus injections rather than constant IV infusions. Activated protein C must be given by continuous IV infusion because of the short T1/2 of that protein.

There is a lower likelihood of bleeding complications from infusions of the protein C zymogens of the invention. Thus, these zymogens can replace heparin intra- and post-surgically in conjunction with thrombectomies or embolectomies, surgical procedures which are often necessary to save ischemic limbs from amputation in the setting of an acute arterial obstruction. Because of their long T1/2, as compared to activated protein C, and their relative ease of administration, these zymogens are better suited than activated protein C for the treatment of arterial emboli originating from the heart. The long term administration of these zymogens in doses comparable to those used for the treatment of established deep vein thrombois-pulmonary embolism has substantial utility in the prevention of cardiogenic emboli.

Similarly, the protein C zymogens of the invention can be used for the treatment of emboli originating from thrombi in peripheral arteries, most notably the carotid arteries, which are not treated or prevented satisfactorily with currently used regimens, which include drugs capable of suppressing platelet function, oral anticoagulants, or combinations thereof. As in the case of cardiogenic emboli, these zymogens can be administrated long term in the same manner as outlined for cardiogenic emboli and have major potential in the prevention of emboli originating from carotid artery thrombi and resulting in embolic strokes.

The protein C zymogens of the invention are also useful in thrombotic strokes. Today, strokes are not usually treated with conventional anticoagulants. Treatment of strokes with either heparin or oral anticoagulants, although occasionally beneficial, carries a high risk for bleeding into the infarcted brain area, thereby aggravating the neurological deficit accompanying the stroke. Because of their low potential for causing bleeding complications and their selectivity, the zymogens of the invention can be given to stroke victims and can be beneficial in preventing the local extension of the occluding arterial thrombus, thereby reducing the neurological deficit resulting from the stroke.

The zymogens of the invention will also be useful in treating acute myocardial infarction, because of their pro-fibrinolytic properties, once activated. These zymogens can be given with tissue plasminogen activator during the acute phases of the myocardial infarction. After the occluding coronary thrombus is dissolved, the zymogens can be given for additional days to prevent acute myocardial reinfarction.

Activated protein C is useful in the treatment of disseminated intravascular coagulation. Heparin and the oral anticoagulants have been given to patients with disseminated intravascular coagulation (DIC) in extensive clinical trials, but the results have been disappointing. In disseminated intravascular coagulation, activated protein C, as well as the zymogens of the present invention, has a distinct advantage over conventional anticoagulants. As mentioned above, it has been established in animal experiments that the protein C zymogen is ineffective in the prevention of death and organ damage from Gram negative septicemia and disseminated intravascular coagulation. In contrast, the protein C zymogens of the invention, being highly susceptible to activation by thrombin, will be effective treatment for disseminated intravascular coagulation.

Conventional anticoagulant drugs, particularly warfarin, are useful in the treatment of invasive malignant tumors. Many tumor cells produce substances which trigger the activation of the coagulation system resulting in local fibrin deposits. These fibrin deposits function as "nests" in which cancer cells can divide to form metastatic lesions. However, it is not possible to administer warfarin or other conventional anticoagulants in combination with the more intensive and effective forms of chemotherapy, because such therapy produces a sharp drop in the platelet count, and thrombocytopenia combined with warfarin therapy puts the patient at an unacceptably high risk for serious bleeding complications. The protein C derivatives of the invention, like activated protein C, being more selective than conventional anticoagulants and having a far higher therapeutic index than either heparin or the oral anticoagulants, can be given relatively safely to the thrombocytopenic patient, thus making possible the treatment of patients with invasive cancers with effective and intensive chemotherapy in combination with a protein C zymogen of the invention.

The zymogens, and activated counterparts, of the present invention can be formulated according to known methods to prepare pharmaceutically useful compositions, whereby a human protein C zymogen or activated protein C of the invention is combined in admixture with a pharmaceutically acceptable carrier vehicle. Suitable carrier vehicles and their formulation, inclusive of other human proteins, e.g., human serum albumin, are described, for example, in Remington's Pharmaceutical Sciences 16th ed., 1980, Mack Publishing Co., edited by Osol et al., which is hereby incorporated by reference. Such compositions will contain an effective amount of a protein C zymogen, or activated counterpart, together with a suitable amount of carrier vehicle to prepare pharmaceutically acceptable compositions suitable for effective administration to the host. The protein C composition can be administered parenterally, or by other methods that ensure its delivery to the bloodstream in an effective form.

It should also be noted that the zymogens of the present invention can be used to prepare activated protein C in vitro. Although recombinant methods for producing activated protein C directly in eukaryotic cells are known, these methods require that the activated protein C remain in the culture media for long periods of time. Because activated protein C is relatively unstable, these direct expression methods can yield low amounts of activated protein C. In contrast, the zymogens of the invention can be activated by thrombin alone, even in the presence of Ca²⁺, and thus offer significant advantages over known methods for producing activated protein C.

The following Examples illustrate the methods and describe the construction protocols for representative compounds, vectors and transformants of the invention without limiting the same thereto.

EXAMPLE 1 Isolation of Plasmid pLPC-FLIN

Lyophils of E. coli K12 AG1/pLPC-FLIN are obtained from the Northern Regional Research Laboratory, Peoria, Ill. 61604, under the accession number NRRL B-18616. The lyophils are decanted into tubes containing 10 ml LB medium (10 g Bacto-tryptone, 5 g Bacto-yeast extract, and 10 g NaCl per liter; pH is adjusted to 7.5) and incubated two hours at 32° C., at which time the cultures are made 50 μg/ml in ampicillin and then incubated at 37° C. overnight.

A small portion of the overnight culture is placed on LB-agar (LB medium with 15 g/l Bacto-agar) plates containing 50 μg/ml ampicillin in a manner so as to obtain a single colony isolate of E. coli K12 AGl/pLPC-FLIN. The single colony obtained was inoculated into 10 ml of LB medium containing 50 μg/ml ampicillin and incubated overnight at 37° C. with vigorous shaking. The 10 ml overnight culture was inoculated into 500 ml LB medium containing 50 μg/ml ampicillin and incubated at 37° C. with vigorous shaking until the culture reached stationary phase.

The following procedure is adapted from Maniatis et al., 1982, Molecular Cloning (Cold Spring Harbor Laboratory).

The cells were harvested by centrifugation at 4000 g for 10 minutes at 4° C., and the supernatant was discarded. The cell pellet was washed in 100 ml of ice-cold STE buffer (0.1M NaCl; 10 mM Tris-HCI, pH 7.8; and 1 mM EDTA). After washing, the cell pellet was resuspended in 10 ml of Solution 1 (50 mM glucose; 25 mM Tris-HCI, pH 8.0; and 10 mM EDTA) containing 5 mg/ml lysozyme and left at room temperature for 10 minutes. Twenty ml of Solution 2 (0.2N NaOH and 1% SDS) were then added to the lysozyme-treated cells, and the solution was gently mixed by inversion. The mixture was incubated on ice for 10 minutes.

Fifteen ml of ice-cold 5M potassium acetate, pH 4.8, were added to the lysed-cell mixture and the solution mixed by inversion. The solution was incubated on ice for 10 minutes. The 5M potassium acetate solution was prepared by adding 11.5 ml of glacial acetic acid to 28.5 ml of water and 60 ml of 5M potassium acetate; the resulting solution is 3M with respect to potassium and 5M with respect to acetate.

The lysed cell mixture was centrifuged in a Beckman SW27 (or its equivalent) at 20,000 rpm for 20 minutes at 4° C. The cell DNA and debris formed a pellet on the bottom of the tube. About 36 ml of supernatant were recovered, and 0.6 volumes of isopropanol were added, mixed, and the resulting solution left at room temperature for 15 minutes. The plasmid DNA was collected by centrifugation at 12,000 g for 30 minutes at room temperature. The supernatant was discarded, and the DNA pellet was washed with 70% ethanol at room temperature. The ethanol wash was decanted, and the pellet was dried in a vacuum desiccator. The pellet was then resuspended in 8 ml of TE buffer (10 mM Tris-HCl, pH 8.0, and 1 mM EDTA).

Eight grams of CsCl were added to the DNA solution. About 0.8 ml of a 10 mg/ml solution of ethidium bromide in water were added for each 10 ml of CsCl-DNA solution The final density of the solution was about 1.55 g/ml, and the ethidium bromide concentraton was about 600 μg/ml. The solution was transferred to a Beckman Type 50 centrifuge tube, filled to the top with paraffin oil, sealed, and centrifuged at 45,000 rpm for 24 hours at 20° C. After centrifugation, two bands of DNA were visible in ordinary light. After removing the cap from the tube, the lower DNA band was removed by using a syringe with a #21 hypodermic needle inserted through the side of the centrifuge tube.

The ethidium bromide was removed by several extractions with water-saturated 1-butanol. The CsCl was removed by dialysis against TE buffer. After extractions with buffered phenol and then chloroform, the DNA was precipitated, washed with 70% ethanol, and dried. About 1 mg of plasmid pLPC-FLIN was obtained and stored at 4° C. in TE buffer at a concentration of about 1 μg/μl. A restriction site and function map of plasmid pLPC-FLIN is presented in FIG. 5 of the accompanying drawings. In the same manner, plasmids pLPC-FN, pLPC-SC, pLPC-LIN and pLPC-N are isolated from their corresponding host cells, also available from the NRRL. Restriction site and function maps of each of these plasmids are presented in the accompanying drawings.

EXAMPLE 2 Construction of Plasmid pGT-FLIN

Plasmids pLPC-N, pLPC-FN, pLPC-SC, pLPC-LIN and pLPC-FLIN may be directly transformed into eukaryotic host cells (preferably 293 cells) for the production of high levels of human protein C zymogens. Even higher levels of expression and secretion of product may be obtained if the gene encoding the mutant zymogen the gene is driven by the GBMT transcription unit. The GBMT transcription unit is fully described in Grinnell et al., U.S. patent application Ser. No. 07/484,082, filed herewith on even date, the entire teaching of which is herein incorporated by reference.

Plasmid pGTC is one such vector, wherein the wild type human protein C zymogen gene is driven by the GBMT transcription unit. The wild type protein C gene can be easily removed from the vector on a BclI restriction fragment and any of the genes of the present invention can be inserted into the vector on a BclI restriction fragment. Digestion of plasmid DNA with BclI is inhibited by methylation at adenine in the sequence 5'-GATC-3'. Therefore, plasmid pGTC was prepared from E. coli host cells that lack an adenine methylase, such as that encoded by the dam gene, the product of which methylates the adenine residue in the sequence 5'-GATC-3'. E. coli K12 GM48 (NRRL B-15725) lacks a functional dam methylase and so is a suitable host to use for the purpose of preparing plasmid pGTC DNA for use as starting material in the construction of plasmid derivatives.

E. coli K12 GM48 cells were cultured and made competent for transformation, and plasmid pGTC was used to transform the E. coli K12 GM48 cells in substantial accordance with the procedure of Example 1. The transformed cells were plated on L-agar containing ampicillin, and once the ampicillin-resistant, E. coli K12 GM48/pGTC transformants had formed colonies, one such colony was used to prepare plasmid pGTC DNA in substantial accordance with the procedure of Example 1. About 1 mg of plasmid pGTC DNA was obtained and suspended in about 1 ml of TE buffer. Similarly, plasmids pGT-h and pGT-d can be prepared to allow BclI digestion. Plasmid pGT-d comprises the GBMT transcription unit with no gene at the BclI site, so that any gene can be easily inserted. Plasmid pGT-d also comprises the murine dhfr gene so that any transformant can be selected or amplified using the methotrexate resistance phenotype. Plasmid pGT-h comprises the GBMT transcription unit, a BclI site for easy insertion of a gene of interest and the hygromycin resistance-conferring gene. E. coli K12 AG1 strains comprising each of these plasmids were deposited with the NRRL on Jan. 18, 1990. The strains are available under the accession numbers NRRL B-18591 (for E. coli K12 AG1/pGT-d), NRRL B-18592 (for E. coli K12 AG1/pGT-h), and NRRL B-18593 (for E. coli K12 AG1/pGTC). Restriction site and function maps of these plasmids are presented in the accompanying drawings.

About 10 μl of the plasmid pLPC-FLIN DNA prepared in Example 1 are mixed with 20 μl 10x BclI restriction buffer (100 mM Tris-HCl (pH 7.4), 1.5M KCl, 100 mM MgCl₂ and 10 mM DTT), 20 μl mg/ml BSA, 5 μl restriction enzyme BclI (˜50 Units, as defined by Bethesda Research Laboratories (BRL), from which all restriction enzymes used herein are obtained), and 145 μl of water, and the resulting reaction is incubated at 37° C. for 2 hours. Restriction enzyme reactions described herein are routinely terminated by phenol and then chloroform extractions, which are followed by precipitation of the DNA, and ethanol wash, and resuspension of the DNA in TE Buffer. The digested DNA is then electrophoresed through a 1% agarose prep gel and the about 1400 base pair restriction fragment comprising the mutant gene is purified using a BioRad Prep-A-Gene Kit, according to the manufacturer's instructions.

Plasmid pGTC is then isolated from E. coli K12 AG1/pGTC (NRRL B-18593) in substantial accordance with the teaching of Example 1 and prepared from GM48 cells as in Example 2. Plasmid pGTC DNA is then digested with restriction enzyme BclI as taught above, then the large vector fragment is isolated and purified. This vector fragment is brought up to 90 μl volume with TE (pH 8.0), then 10 μl (0.05 Unit) of Calf Instestine Alkaline Phosphatase is added to dephosphorylate the vector ends. The mixture is incubated at 37° C. for 30 minutes, then 10 μl of 500 mM EGTA is added and the reaction is incubated at 65° C. for 45 minutes to inactivate the enzyme. The reaction is then phenol/chloroform extracted, ethanol precipitated, washed and resuspended in 20 μl of water.

About 7 μl (10 ng) of the BclI-digested vector backbone is then mixed with about 1 μl (100 ng) of the about 1400 base pair BclI restriction fragment of plasmid pLPC-FLIN, 1 μl 10X ligase buffer (0.5M Tris-HCl (pH 7.6), 100 mM MgCl₂, 100 mM DTT and 500 μg/ml BSA) and 1 μl T4 DNA ligase. The ligation reaction is then incubated for 12 to 16 hours at 16° C. The ligation reaction can lead to plasmids which contain the mutant zymogen gene oriented for transcription from the GBMT transcription unit, or plasmids wherein the gene is ligated in the opposite direction. Those plasmids which contain the gene in the proper orientation for transcription are designated plasmid pGT-FLIN.

Frozen competent E. coli K12 AG1 cells are obtained from Strategene, 3770 Tansey Road, San Diego, Calif. 92121. About 5 μl of the ligation reaction is mixed with a 100 μl aliquot of competent cells, then the cell-DNA mixture is incubated on ice for one hour, heat-shocked at 42° C. for 45 seconds, then chilled on ice for about 2 minutes. The cell-DNA mixture is diluted into 1.0 ml of LB media in a Falcon 2059 tube and incubated at 37° C. for one hour. One hundred microliter aliquots are plated on LB-agar plates containing ampicillin and incubated at 37° C. until colonies appear.

The colonies are individually cultured, and the plasmid DNA of the individual colonies is examined by restriction enzyme analysis. Plasmid DNA isolation is performed on a smaller scale in accordance with the procedure of Example 1, but the CsCl step is omitted until the proper E. coli K12 AG1/pGT-FLIN transformants are identified. At that time, a large scale, highly purified plasmid prep is performed. Following the teaching of Examples 1 and 2, any of the mutant zymogen genes can easily be cloned into any of the GBMT vectors.

EXAMPLE 3 Construction of Adenovirus-transformed Human Embryonic Kidney Cell Line 293 and Adenovirus-transformed Syrian Hamster Cell Line AV12 Transformants Using Plasmid pGT-FLIN

Human Embryonic Kidney Cell Line 293 is available from the American Type Culture Collection under the accession number ATCC CRL 1573. The adenovirus-transformed Syrian hamster cell line AV12 is also available from the American Type Culture Collection under the accession number ATCC CRL 9595. The transformation procedure described below refers to 293 cells as the host cell line; however, the procedure is generally applicable to most eukaryotic cell lines, including the AV12 cell line, and to the expression vectors of the invention.

293 cells are obtained from the ATCC under the accession number CRL 1573 in a 25 mm² flask containing a confluent monolayer of about 5.5×10⁶ cells in Eagle's Minimum Essential Medium (Gibco) with 10% heat-inactivated horse serum. The flask is incubated at 37° C.; medium is changed twice weekly. Media is composed of DMEM (Gibco) supplemented with 10% fetal calf serum, 50 μg/ml gentamicin, and 10 μg/ml AquaMEPHYTON® phytonadione vitamin K₁ (Merck Sharp and Dohme, Merck and Co., Inc., West Point, Pa. 19486). The cells are subcultured by removing the medium, rinsing with Hank's Balanced Salts solution (Gibco), adding 0.25% trypsin (containing 0.2 g/L EDTA) for 1-2 minutes, rinsing with fresh medium, aspirating, and dispensing into new flasks at a subcultivation ratio of 1:5 or 1:10.

One day prior to transformation, cells are seeded at 0.7×10s cells per 100 mm dish. Sterile, ethanol-precipitated plasmid DNA dissolved in water is used to prepare a 2X DNA-CaCl₂ solution containing 25 μg/ml of the transforming plasmid DNA and 250 mM CaCl₂. 2X HBSS is prepared containing 280 mM NaCl, 50 mM Hepes, and 1.5 mM sodium phosphate, with the pH adjusted to 7.05.7.15. The 2X DNA-CaCl₂ solution is added dropwise to an equal volume of sterile 2X HBSS. A one ml sterile plastic pipette with a cotton plug is inserted into the mixing tube that contains the 2X HBSS, and bubbles are introduced by blowing while the DNA is being added. The calcium-phosphate-DNA precipitate is allowed to form without agitation for 30-45 minutes at room temperature.

The precipitate is then mixed by gentle pipetting with a plastic pipette, and one ml (per plate) of precipitate is added directly to the 10 ml of growth medium that covers the recipient cells. After 4 hours of incubation at 37° C., the media is replaced with fresh media and the cells allowed to incubate for an additional 72 hours before providing selective pressure. For plasmids that do not comprise a selectable marker that functions in eukaryotic cells, such as plasmid pGT-FLIN, the transformation procedure utilizes a mixture of plasmids: the expression vector of the present invention that lacks a selectable marker; and an expression vector that comprises a selectable marker that functions in eukaryotic cells. A variety of vectors are available for use in such cotransformation systems and include plasmids pSV2-dhfr (ATCC 37146), pSV2-neo (ATCC 37149), pSV2-gpt (ATCC 37145), and pSV2-hyg (NRRL B-18039). Plasmid pSV2-hyg confers resistance to hygromycin B to eukaryotic host cells. This co-transformation technique allows for the selection of cells that contain the plasmid with the selectable marker. These cells are further examined to identify cells that comprise both of the transforming plasmids. Of course, the present invention also comprises expression vectors that contain a selectable marker for eukaryotic cells and thus do not require use of the cotransformation technique.

For cells transfected with plasmids containing the hygromycin resistance-conferring gene such as plasmid pGT-FLIN-h, hygromycin B is added to the growth medium to a final concentration of about 200 μg/ml. The cells are then incubated at 37° C. for 2-4 weeks with medium changes at 3 to 4 day intervals. The resulting hygromycin-resistant colonies are transferred to individual culture flasks for characterization. Plasmid pSV2-neo confers resistance to neomycin (G418 is also used in place of neomycin), and selection of G418-resistant colonies is performed in substantial accordance with the selection procedure for hygromycin-resistant cells, except that G418 is added to a final concentration of 400 μg/ml.

The use of the dihydrofolate reductase (dhfr) gene or the methotrexate resistance-conferring derivative of the dhfr gene (dhfr-mtx) as a selectable marker for introducing a gene or plasmid into a dhfr-deficient cell line and the subsequent use of methotrexate to amplify the copy number of the plasmid has been well established in the literature. 293 cells are dhfr positive, so 293 transformants that contain plasmids comprising the dhfr gene are not selected solely on the basis of the dhfr-positive phenotype, which is the ability to grow in media that lacks hypoxanthine and thymine. Cell lines that do lack a functional dhfr gene and are transformed with dhfr-containing plasmids can be selected for on the basis of the dhfr+ phenotype. Although the use of dhfr as a selectable and amplifiable marker in dhfr-producing cells has not been well studied, evidence in the literature would suggest that dhfr can be used as a selectable marker and for gene amplification in dhfr-producing cells. The present invention is not limited by the selectable marker used on expression vectors. Moreover, amplifiable markers such as metallothionein genes, adenosine deaminase genes, or members of the multigene resistance family, exemplified by the P-glycoprotein gene, can be utilized.

Transformation of the 293 and AV12 cell lines with a mixture of plasmid pGT-FLIN and a hygromycin resistance-conferring vector and subsequent selection for hygromycin-resistant cells yields a number of transformants. (Other transformants are obtained by using plasmid pSV2-neo as the cotransforming vector and selecting for G418-resistant cells.) The procedure in this example can be used for each of the HPC zymogen plasmids of the present invention.

EXAMPLE 4 Selection of Cells Secreting Human Protein C Zymogen Mutants

The hygromycin-resistant transformants obtained in Example 3 are grown on 100 mm² tissue culture dishes at a density of several hundred cell clones per tissue culture dish. The media is decanted, and the cells are rinsed twice with 5 ml aliquots of Hank's Balanced salt solution (Gibco). A solution of sterile 0.45% agar (Sigma Type 4 agarose, catalogue #A3643, Sigma Chemical Co., P.0. Box 14508, St. Louis, Mo. 63178) is prepared by mixing 1 ml of 1.8% agar (47° C.) with 3 ml of Dulbecco's Modified Eagle's (DME) Salts (Gibco) (37° C.), and 2 ml of this 0.45% agar solution are layered over the cells.

Nitrocellulose filters (Schleicher and Schuell, Inc., Keene, N.H. 03431) are boiled and then autoclaved 2 hours to remove the wetting agent, which is toxic to the cells. The filters are then placed on top of the agar layer, and after air bubbles are removed, the plates are incubated at 37° C. for 1 to 3 hours. The filters, previously marked to indicate the original orientation of the filter on the dish so as to facilitate later identification of colonies, are then removed and placed in PBS (50 mM Tris-HCl, pH=7.2, and 150 mM NaCl).

To keep the cells on the dish viable during analysis of the filters, the cells are overlayed with 8 ml of a mixture containing 2 ml of 1.8% agar (47° C.), 2 ml of DME salts (37° C.), and 4 ml of DME salts with 20% fetal bovine serum (37° C.). The cells are then placed in a 37° C. incubator.

All washes and reactions carried out on the filters are accomplished while the filters are on a rocking platform. The filters are first blocked by incubation at room temperature in 5% milk in PBS. The filters are then rinsed (5 minutes/rinse) four times in PBS. A 10 μg/ml biotinylated goat anti-human protein C polyclonal antibody in 2.5% bovine serum albumin is added to the filter (in sufficient quantities to cover the filter), which is then incubated at 37° C. for 1 hour.

Purification of protein C, for subsequent use to prepare antibody against protein C, can be accomplished as described by Kisiel, 1979, J. Clin. Invest. 64:761. Polyclonal antibody can be prepared by the procedure disclosed in Structural Concepts in Immunology and Immunochemistry by E. A. Kabat, published in 1968 by Holt, Rhinehart, and Winston. Monoclonal antibody, which is also suitable for use in the assay, can be prepared as disclosed in Kohler and Milstein, 1975, Nature, 256:495, or as disclosed in U.S. Pat. No. 4,696,895; EPO Pub. No. 205046; Laurell et al., 1985, FEBS 191(1):75; Suzuki et al., 1985, J. Biochem. 97:127-138; and EPO Pub. No. 138222. The avidin D and biotinylated horse radish peroxidase (HRP) used in the assay are obtained in a Vectastain™ kit (Vector Laboratories, Inc., 30 Ingold Road, Burlingame, Calif. 94010). Biotin is also obtained from Vector Laboratories, Inc.

The filters are rinsed four times with PBS at 4° C. Then, avidin D and biotinylated horse radish peroxidase are prepared and added as per the manufacturer's instructions in the Vectastain™ (Vector Laboratories) kit. The filters are incubated with the HRP-conjugated avidin D for 1 hour at 4° C. (longer incubation times, i.e., overnight, can be used when small amounts of protein are being secreted); then, the filters are rinsed four times with PBS at 4° C.

To develop the indicator color on the filters, about 30 mg of HRP color-development reagent (4-chloro-1-napthol, Sigma) dissolved in ice-cold 100% methanol are added to 50 ml of PBS and 30 μl of 30% H₂ O₂. This mixture is added to the nitrocellulose filters, which are incubated at room temperature until the color develops. Colonies secreting the most human protein C zymogen of the invention will be indicated on the filters not only by earliest appearance of the color but also by darker spots on the filter.

After the filters have been developed, the filters are again realigned with the original plates to determine which colonies are associated with which spots on the filter. The colonies secreting the most human protein C zymogen of the invention are then selected and used for production of the zymogen.

Those skilled in the art will recognize that the above assay is merely illustrative of the method of identifying high secreting cell lines. A variety of assay procedures can be successfully employed in the method. For instance, a double-antibody reaction can be employed in which the biotinylated goat anti protein C antibody is replaced with a goat anti-protein C antibody (IgG) and a biotinylated anti-goat IgG antibody.

The zymogen mutants may be purified from the cell cultures. The supernatant is removed from cells expressing the recombinant product and purified on a Pharmacia Fastflow-Q column. About 1 ml of the resin is equilibrated with 20 mM Tris-HCl (pH 7.4), 0.15M NaCl, 5 mM EDTA and 4 mM benzamidine. The culture supernatant is brought to pH 7.4 by the addition of Tris-HCl (pH 8.0) and brought 5 mM EDTA and 4 mM benzamidine. The supernatant is loaded onto the resin in a column and washed with three column volumes of Tris-HCl (pH 7.4), 0.15M NaCl. The recombinant product is eluted from the column using an elution buffer containing 10 mM CaCl₂, 150 mM NaCl in 20 mM Tris-HCl (pH 7.4).

Specific activity of the product is determined according to the procedure of Grinnell et al., (1987), Biotechnology 5:1189-1192 as follows: concentrated product from the column eluate is first activated with an immobilized thrombin-thrombomodulin complex. The amidolytic activity of the product was measured by the hydrolysis of a tripeptide substrate S-2366 (obtained from Helena Labs). The anticoagulant activity of the product is determined by the prolongation of an activated partial thromboplastin time using reagents from Helena. 

We claim:
 1. A DNA compound wherein the polypeptide encoded by said DNA is: ##STR22##
 2. A recombinant DNA expression vector comprising the DNA compound of claim
 1. 3. The vector of claim 2 that is plasmid pLPC-N.
 4. The vector of claim 3 that is plasmid pFT-N.
 5. A DNA compound wherein the polypeptide encoded by said DNA is: ##STR23## wherein R₁ is ASP, R₂ is GLN, R₃ is GLU, R₄ is ASP, R₅ is GLN, R₆ is VAL, R₇ is PHE, R₈ is PRO, R₉ is ARG, R₁₀ is LEU, R₁₁ is a deletion and R₁₂ is ASN.
 6. A recombinant DNA expression vector comprising the DNA compound of claim
 5. 7. The vector of claim 6 that is plasmid pLPC-FN.
 8. A DNA compound of wherein the polypeptide encoded by said DNA is: ##STR24## wherein R₁ is LEU, R₂ is HIS, R₃ is LYS, R₄ is LEU, R₅ is GLN, R₆ is THR, R₇ is TYR, R₈ is PRO, R₉ is ARG, R₁₀ is THR, R₁₁ is a deletion and R₁₂ is ASN.
 9. A recombinant DNA expressing vector comprising the DNA compound of claim
 8. 10. The vector of claim 9 that is plasmid pLPC-SC.
 11. A eukaryotic host cell transformed with a vector of claim
 5. 12. The eukaryotic host cell of claim 11 that is 293/pLPC-N.
 13. The eukaryotic host cell of claim 11 that is 293/pGT-N.
 14. A eukaryotic host cell transformed with a vector of claim
 9. 15. The eukaryotic host cell of claim 14 that is 293/pLPC-FN.
 16. A eukaryotic host cell transformed with a vector of claim
 12. 17. The eukaryotic host cell of claim 16 that is 293/pLPC-SC.
 18. A method for the recombinant production of a zymogen form of human protein C upon secretion from a eukaryotic host cell, which comprises(A) transforming a eukaryotic host cell with a recombinant DNA vector, said vector comprising:(i) a DNA sequence that encodes an amino acid residue sequence, said amino acid residue sequence comprising, from the amino terminus to the carboxy terminus;a) a signal peptide and pro-peptide of a gamma-carboxylated, secreted protein; b) the light chain of human protein C; c) a dipeptide selected form the group consisting of LYS-ARG, ARG-LYS, LYS-LYS, and ARG-ARG; and d) the amino acid residue sequence: ##STR25## wherein R₁ is selected from the group consisting of ASP and LEU, R₂ is selected from the group consisting of GLN and HIS, r3 is selected from the group consisting of GLU and LYS, R4 is selected from the group consisting of ASP and LEU, R5 is GLN, R6 is selected from the group consisting of ASP, PHE and TYR, R8 is PRO, R9 is ARG, R10 is selected from the group consisting of LEU and THR, R11 is a deletion, and R12 is ASN and --COOH is the carboxy terminus; (ii) a promoter positioned to drive expression of said DNA sequence; (b) culturing said host cell transformed in step (A) under conditions that allow for expression of said DNA sequence, and (C) recovering said zymogen form of protein C from said culture.
 19. The method of claim 18 wherein R₁ is ASP, R₂ is GLN, R₃ is GLU, R₄ is ASP, R₅ is GLN, R₆ is VAL, R₇ is ASP, R₈ is PRO, R₉ is ARG, R₁₀ is LEU, R₁₁ is a deletion and R₁₂ is ASN.
 20. The method of claim 19 wherein the recombinant DNA expression vector is plasmid pLPC-N.
 21. The method of claim 19 wherein the recombinant DNA expression vector is plasmid pGT-N.
 22. The method of claim 18 wherein R₁ is ASP, R₂ is GLN, R₃ is GLU, R₄ is ASP, R₅ is GLN, R₆ is VAL, R₇ is PHE, R₈ is PRO, R₉ is ARG, R₁₀ is LEU, R₁₁ is a deletion and R₁₂ is ASN.
 23. The method of claim 22 wherein the recombinant DNA expression vector is plasmid pLPC-FN.
 24. The method of claim 18 wherein R₁ is LEU, R₂ is HIS, R₃ is LYS, R₄ is LEU, R₅ is GLN, R₆ is THR, R₇ is TYR, R₈ is PRO, R₉ is ARG, R₁₀ is THR, R₁₁ is a deletion and R₁₂ is ASN.
 25. The method of claim 24 wherein the recombinant DNA expression vector is plasmid pLPC-SC. 